Environmentally aware voice activity detector

نویسندگان

  • Abhijeet Sangwan
  • Nitish Krishnamurthy
  • John H. L. Hansen
چکیده

Traditional voice activity detectors (VADs) tend to be deaf to the acoustical background noise, as they (i) utilize a single operating point for all SNRs (signal-to-noise ratios) and noise types, and (ii) attempt to learn the background noise model online from finite data length. In this paper, we address the aforementioned issues by designing an environmentally aware (EA) VAD. The EA VAD scheme builds prior offline knowledge of commonly encountered acoustical backgrounds, and also combines the recently proposed competitive Neyman-Pearson (CNP) VAD with a SVM (support vector machine) based noise classifier. In operation, the EA VAD obtains accurate noise models of the acoustical background by employing the noise classifier and its prior knowledge of the noise type, and thereafter uses this information to set the best operating point and initialization parameters for the CNP VAD. The superior performance of the EA VAD scheme over the standard AMR (adaptive multi-rate) VADs in low SNR is confirmed in a simulation study, where speech and noise data were drawn from the SWITCHBOARD and NOISEX databases. We report an absolute improvement of 10-15% in detection rates over AMR VADs in low SNR for different noise types.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Context- and cost-aware feature selection in ultra-low-power sensor interfaces

This paper introduces the use of machine learning to improve efficiency of ultra-low-power sensor interfaces. Adaptive feature extraction circuits are assisted by hardware embedded learning to dynamically activate only most relevant features. This selection is done in a context and power cost-aware way, through modification of the C4.5 algorithm. Furthermore, context dependence of different fea...

متن کامل

Methodology for the Design of a Robust Voice Activity Detector for Speech Enhancement

We propose a general methodology to design a robust voice activity detector that suits the needs of the speech enhancement system it is dedicated to. More than imposing rules, we initiate ideas on how to perform the analysis of the requirements for the Voice Activity Detection (VAD) and how to choose a reference, and evaluate the performances of the explored solutions in order to choose the one...

متن کامل

Voice activity detection using source separation techniques

A novel Voice Activity Detector is presented that is based on Source Separation techniques applied to single sensor signals. It ooers very accurate estimation of the endpoints in very low Signal to Noise ratio conditions, while maintaining low complexity. Since the procedure is totally iterative, it is suitable for use in real-time applications and is capable of operating in dynamically adaptin...

متن کامل

Spectral entropy-based voice activity detector for videoconferencing systems

This paper proposes a statistical voice activity detector (VAD) suitable for videoconferencing applications, where detection of higher level speech activities, e.g., sentences instead of syllables, words, phrases, etc, is useful. The proposed method uses two distinct features for VAD, energy and entropy in the decorrelated domain, which are modeled as chi-square and Gaussian distributions respe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007